May 26, 2025

Transform data from wide to long format using R

May 26, 2025

Often, when we input data into a spreadsheet, we use the wide format where the sequence of variables are ordered according to the columns. But when we perform longitudinal analyses, we need to transform this to the long format.

Sometimes, I forget how to do this in R, so I decided to write a tutorial to remind myself how to do this.

Therefore, I wrote a tutorial on using the pivot_longer() function to transform data from the wide to long format in preparation for longitudinal data analysis. The tutorial is located on my RPubs page.

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

Generate data using the simstudy package in R

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

There are times when you are looking for a dataset to test a code or formula, but they are hard to find or are not publicly available. To get around this problem, we can generate our own data. R provides several tools for us to accomplish this.

I wrote a short guide on how to generate data using the simstudy package in R. You can read how to do this on my Rpub site (link).

Mark Bounthavong

March 30, 2025

Epidemiology, Methods, R Programming

Medication adherence estimations using R - Part 1

Mark Bounthavong

March 30, 2025

Epidemiology, Methods, R Programming

I created a tutorial on how to use the AdhereR package in R to estimate the medication adherence rate for a sample of individuals with prescription claims data. I posted the tutorial on my RPubs page (link).

The two most common medication adherence meaures are the Medication Possession Ratio (MPR) and the Proportion of Days Covered (PDC). This tutorial reviews how to estimate these medication adherence rates using AdhereR in R.

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

Propensity score matching in R

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

I wrote an introductory tutorial on how to perform propensity score matching using R, which has been posted on my RPubs site (link).

Propensity score matching is a statistical approach to balancing the observed covariates between groups. In observational studies, this method has the potential to mitigate potential confounding and allow us to make causal interpretations. However, there are a lot of approaches and nuances. This intorductory tutorial presents the basics of propensity score methods and how we can use these in our conventional analyses.

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

Prepost analysis with continuous data using R - Part 1

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

I wrote a tutorial on how to perform simple prepost analysis using R, which is available on my RPubs page. It covers how to compare two differences (change in value before and after an interention) using independent t test and linear regression approaches. However, it doesn’t cover how to address correlation between two dependent values. Part 2 of prepost analysis will cover those issues.

Mark Bounthavong

September 28, 2024

R Programming

Tips and Tricks (Guide) with R and RStudio

Mark Bounthavong

September 28, 2024

R Programming

I wrote a collection of tips and tricks (guide) for R and RStudio (link). This is a work in progress, and I plan to update this in the fiture.

Mark Bounthavong

August 25, 2024

Cost-effectiveness models, R Programming

Distributions in cost-effectiveness analysis

Mark Bounthavong

August 25, 2024

Cost-effectiveness models, R Programming

In cost-effectiveness analysis, we deal with uncertainty in our parameters by performing sensitivity analyses. In this article, I review how we can generate these distributions for common paramters in a cost-effectiveness analysis. You can view the article at my RPubs page.

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

Staggered difference-in-differences using R

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

I was interested in learning how to apply the Callaway & Sant'Anna staggered difference-in-differences framework to my work. After reading several papers and watching the video by Sant'Anna, I wrote a short tutorial on how to apply this framework to a simulated data. The tutorial is located on my RPubs site.

This is a unique method that used the R “did” package, which is based on the paper by Callaway & Sant’Anna.

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

Mediation analysis using R

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

It’s not uncommon to see covariates in a regression model that should not be there. For example, measurements that occur after the treatment assignment are included into a regression model as baseline covariates. Rather, one should consider a mediation analysis.

I wrote a tutorial on how to perform mediation analysis using R on my RPubs site (link).

I know that I make this mistake at times. This tutorial helped me to carefully consider which covariates to include in a regression model and which ones to consider for mediation analysis.

Mark Bounthavong

April 28, 2024

R Programming

R - Loading data from Google drive

Mark Bounthavong

April 28, 2024

R Programming

Recently, my colleague contacted me to assist another colleague who was having trouble loading data stored on a Google drive account into R. I have never thought about using Google drive as a place to store data and then load it into the R environment. Normally, I store and load data from GitHub, but there are some limitations, particularly when the dataset is very large. Google drive might be an easy workaround to this limitation, so I decided to figure out how to make this work.

I posted this tutorial on my RPubs site.

Transform data from wide to long format using R

Generate data using the simstudy package in R

Medication adherence estimations using R - Part 1

Propensity score matching in R

Prepost analysis with continuous data using R - Part 1

Tips and Tricks (Guide) with R and RStudio

Distributions in cost-effectiveness analysis

Staggered difference-in-differences using R

Mediation analysis using R

R - Loading data from Google drive

Categories

Use the search tool to find a specific blog

Previous blogs